| Name | Version | Summary | date |
| legen |
0.19.2 |
Powerfull toolkit that locally transcribes, translates, and masters subtitles for your media |
2025-11-04 00:45:07 |
| nulla |
0.0.5 |
Nulla: a local AI companion bootstrapper (Windows) with voice (Whisper ASR + XTTS v2 TTS), llama.cpp + OpenHermes GGUF, and built-in mini-games. |
2025-11-02 18:55:59 |
| audio-subtitler |
0.1.2 |
Convert audio files to subtitles (VTT, SRT) using Faster-Whisper |
2025-11-01 09:59:36 |
| localtranscribe |
3.1.2 |
Privacy-first audio transcription with speaker diarization, labeling, and context-aware proofreading. Entirely offline. |
2025-10-31 04:55:55 |
| podlens |
1.2.17 |
Intelligent Podcast & Youtube Transcription & Understanding AI Agent |
2025-10-29 22:47:48 |
| stream-translator-gpt |
2025.10.30 |
Command line tool to transcribe & translate audio from livestreams in real time |
2025-10-29 20:32:14 |
| cued-speech |
0.4.1 |
Cued Speech Processing Tools - Decode and Generate cued speech videos |
2025-10-29 09:16:47 |
| vogent-turn |
0.1.1 |
Lightweight turn detection library for conversational AI |
2025-10-28 00:28:20 |
| transub |
0.2.0 |
CLI tool to transcribe and translate subtitles from videos |
2025-10-27 08:44:49 |
| framewise |
0.1.3 |
AI-powered video tutorial assistant with intelligent frame extraction and multimodal RAG |
2025-10-19 00:25:34 |
| lecture-downloader |
1.1.10 |
A comprehensive toolkit for downloading, merging, and transcribing lecture videos |
2025-10-17 01:35:08 |
| phonexia-grpc |
2.24.0 |
Library for communication with microservices developed by phonexia using grpc application interface. |
2025-10-14 14:28:24 |
| claudecut |
0.1.3 |
AI-powered video editing agent using Claude |
2025-10-09 15:54:59 |
| linto |
1.1.2 |
Wrapper around LinTO Studio API |
2025-10-07 16:32:04 |
| gogadget |
0.3.3 |
gogadget is a toolkit for producing immersion and priming materials for language learning. It is capable of downloading audio and video files, automatically transcribing subtitles from videos and podcasts, and automatically producing filtered Anki decks with sentence audio / translations / screenshots / definitions. |
2025-09-16 16:27:52 |
| faster-whisper-hotkey |
0.4.2 |
Push-to-talk transcription |
2025-09-14 11:47:50 |
| pygpt-net |
2.6.45 |
Desktop AI Assistant powered by: OpenAI GPT-5, GPT-4, o1, o3, Gemini, Claude, Grok, DeepSeek, and other models supported by Llama Index, and Ollama. Chatbot, agents, completion, image generation, vision analysis, speech-to-text, plugins, internet access, file handling, command execution and more. |
2025-09-13 23:25:46 |
| ispeak |
0.4.0 |
A keyboard-centric inline speech-to-text CLI tool that works wherever you can type |
2025-09-11 16:47:18 |
| boio |
0.1.0 |
An opinionated way of doing AI - Task-oriented ai nodes for general-computing |
2025-09-08 17:15:02 |
| tubescribe |
0.3.4 |
CLI to transcribe YouTube audio via Whisper (local) or Gemini (cloud) |
2025-09-08 08:06:33 |